A New and Efficient Alignment Technique by Cosine Distance
نویسندگان
چکیده
In this paper we describe a new technique to measure the similarity or distance between time series. We have called it, Alignment Technique by Cosine Distance (ATCD). Important features about the technique are that it requires neither a-priori knowledgement of the time series nor training stages. ATCD is based on cosine distance and least squares, and requires as a parameter the dimension of two support vectors. When we consider high dimensionality on these vectors, ATCD achieves its best performance providing the smallest measure of similarity (distance) as possible. ATCD can be used on applications of medical signal processing, audio and speech recognition, among others. We proved ATCD ́s efficiency on an isolated-words speech recognition system by comparing ATCD against Dynamic Time Warping.
منابع مشابه
One Size Fits All? A Simple Technique to Perform Several NLP Tasks
Word fragments or n-grams have been widely used to perform different Natural Language Processing tasks such as information retrieval [1] [2], document categorization [3], automatic summarization [4] or, even, genetic classification of languages [5]. All these techniques share some common aspects such as: (1) documents are mapped to a vector space where n-grams are used as coordinates and their ...
متن کاملHyperbolic Cosine Log-Logistic Distribution and Estimation of Its Parameters by Using Maximum Likelihood Bayesian and Bootstrap Methods
In this paper, a new probability distribution, based on the family of hyperbolic cosine distributions is proposed and its various statistical and reliability characteristics are investigated. The new category of HCF distributions is obtained by combining a baseline F distribution with the hyperbolic cosine function. Based on the base log-logistics distribution, we introduce a new di...
متن کاملgpALIGNER: A Fast Algorithm for Global Pairwise Alignment of DNA Sequences
Bioinformatics, through the sequencing of the full genomes for many species, is increasingly relying on efficient global alignment tools exhibiting both high sensitivity and specificity. Many computational algorithms have been applied for solving the sequence alignment problem. Dynamic programming, statistical methods, approximation and heuristic algorithms are the most common methods appli...
متن کاملImprovement Tfidf for News Document Using Efficient Similarity
This study proposed a new method about clustering in documents. Clustering is a very powerful data mining technique for topic discovery from documents. In document clustering, it must be more similarity between intra-document and less similarity between intra-document of two clusters. The cosine function measures the similarity of two documents. When the clusters are not well separated, partiti...
متن کاملQuick and Reliable Document Alignment via TF/IDF-weighted Cosine Distance
This work describes our submission to the WMT16 Bilingual Document Alignment task. We show that a very simple distance metric, namely Cosine distance of tf/idf weighted document vectors provides a quick and reliable way to align documents. We compare many possible variants for constructing the document vectors. We also introduce a greedy algorithm that runs quicker and performs better in practi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IJCOPI
دوره 4 شماره
صفحات -
تاریخ انتشار 2013